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ABSTRACT:  It  is  known  that  under  a  wide  variety  of  assumptions  a  database 
decomposition  is  lossless  if  and  only  if  the  database  scheme  has  a  lossless 
join.  Biskup,  Dayal,  and  Bernstein  have  shown  that  when  the  given  depen¬ 
dencies  are  functional  then  the  database  scheme  has  a  lossless  join  if  and 
only  if  one  of  the  relation  scheme  is  a  key  for  the  universal  scheme.  In 
this  note  we  supply  an  alternative  proof  of  that  characterization.  The 
proof  uses  tools  from  the  theory  of  embedded  join  dependencies  and  the 
theory  of  tuple  and  equality  generating  dependencies,  but  is,  nevertheless, 
much  simpler  than  the  previously  published  proof. 


1.  Introduction 


A  significant  portion  of  research  on  relational  database  theory  has  been  concerned  with 
the  properties  of  decompositions.  The  generic  problem  can  be  described  as  follows.  Given  a 
“universal”  relation  scheme  presented  as  a  set  of  attributes  and  a  set  of  dependencies,  what  are 
the  conditions  under  which  it  can  be  decomposed  into  a  collection  of  relation  schemes,  each 
with  its  own  sets  of  attributes  and  dependencies,  having  some  desired  properties.  The  proper¬ 
ties  considered  were,  at  first  various  normal  forms,  (see  [Ma,  Ul)). 

A  basic  assumption  underlying  these  ideas  is  that  when  a  universal  scheme  is  decomposed 
into  smaller  schemes,  each  of  the  universal  relations  associated  with  it  is  decomposed  into 
smaller  relations  using  the  projection  operation,  i.e.,  exh  such  relation  is  projected  onto  each 
one  of  the  smaller  schemes.  For  a  decomposition  to  be  useful,  it  should  be  lossless.  In  other 
words,  it  should  be  possible  to  reconstruct  the  universal  relations  from  their  projections.  The 
desirability  of  this  property  is  called  in  [BBG]  the  representation  principle. 

The  property  of  losslessness  has  been  studied  in  numerous  papers  [BR,MMSU,MUV,Va]. 
It  has  been  shown  there  that,  under  a  wide  variety  of  assumptions,  a  decomposition  is  lossless  if 
and  only  if  the  database  scheme  has  a  lossless  join.  When  the  given  dependencies  are  func¬ 
tional  the  following  characterization  of  database  schemes  that  have  lossless  join  is  given  in 
[BDB]:  a  database  scheme  has  a  lossless  join  if  and  only  if  one  of  the  relation  schemes  is  a  key 
of  the  universal  scheme. 

The  proof  of  the  above  characterization  in  [BDB]  is  quite  involved  and  consists  of  a 
detailed  study  of  the  test  for  losslessness  in  [ABU].  In  this  note  we  describe  an  alternative 
proof  for  that  characterization.  The  proof  uses  more  advanced  tools  from  dependency  theory, 
but  is,  we  believe,  much  simpler.  Specifically,  we  use  the  theory  of  embedded  join  dependencies 
[BV1]  and  the  theory  of  tuple  and  equality  generating  dependencies  [BV2.BV3J.  Our  intention 
is  to  demonstrate  how  "higher-level"  notions  from  dependency  theory  can  be  used  to  study  its 
most  fundamental  questions.  We  believe  that  techniques  used  here  couI$  also  be  applicable  to 
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other  problems. 

1  The  Theorem 

We  assume  familiarity  with  the  terminology  and  concepts  of  relational  database  theory  as 
presented  in  [Ma,  U1J.  The  universe  U  is  a  finite  set  of  attributes.  All  attribute  sets  are  subsets 

of  U.  An  attribute  set  collection  (asc)  is  a  set  R  =  {.ff  i . Rk]  of  distinct  attribute  sets.  We 

denote  attribute  sets  by  lightface  letters  and  asc’s  by  boldface  letters.  A  database  scheme  is  an 

it 

asc  R  =  {/?lf . . .  ,Rk}  such  that  (J  R,  =  U.  A  relation  is  a  relation  on  U  unless  explicitly 

<=i 

specified  otherwise.  We  use  7[,Y]  to  denote  the  projection  of  the  relation  /  on  the  attribute  set 
X,  and  *lj  to  denote  the  join  of  the  set  {/,}  of  relations.  We  assume  that  the  relations  we  are 

dealing  with  and,  accordingly,  the  dependencies  that  refer  to  them  are  typed,  that  is,  distinct 
attributes  have  disjoint  domains.  We  also  assume  that  all  relations  are  finite. 

A  Junctional  dependency  (fd)  is  a  statement  X  —  Y,  where  X  and  Y  are  attribute  sets.  It 

is  satisfied  by  a  relation  I  if  for  all  tuples  s  and  i  in  /,  if  j[X]=/[A,J  then  s[T]=r[T]-  A  total 

join  dependency  (tjd)  is  a  statement  *[R],  where  R  is  a  database  scheme  {R\, . . . ,  Rk}.  It  is 

* 

satisfied  by  a  relation  /  if  /  =  *  /[£,]. 

Let  2  be  a  set  of  dependencies  and  let  t  be  a  dependency.  We  say  that  2  implies  r, 
denoted  2 1=  r  if  every  relation  on  U  that  satisfies  all  dependencies  in  2  satisfies  also  r.  A 
database  scheme  R  has  a  lossless  join  with  respect  to  2  if  2  h  *[RJ.  A  set  2  of  fd’s  is  embedd¬ 
able  in  a  database  scheme  R  if  for  each  fd  X-+Y  in  2  there  is  some  R  in  R  such  that 
XYQR} 

Theorem.  [BDB]  Let  2  be  a  set  of  fd’s  and  let  R  be  a  database  scheme. 

(1)  If  R  has  a  lossless  join  with  respect  to  2,  then  there  is  some  R  €R  such  that  2  R  -*■  U, 


1  Foflowini  conveniioo  we  denote  set  union  by  juxtaposition. 
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(2)  If  2  is  embeddable  in  R  and  there  is  some  R  6R  such  that  Z£R-+U,  then  R  has  a  loss* 

less  join  with  respect  to  2.  ■ 

3.  The  Necessary  Condition 
3.1.  Preliminary  Definitions 

We  prove  the  necessary  condition  via  an  excursion  through  the  theory  of  tuple  and  equal¬ 
ity  generating  dependencies.  We  need  first  to  review  some  definitions. 

A  valuation  is  a  mapping  on  the  domain  of  which  tuples  and  relations  are  constructed. 
We  can  extend  the  definition  of  a  valuation  to  tuples,  in  a  component-wise  manner,  and  to  rela¬ 
tions,  in  a  tuple-wise  manner. 

Equality  generating  dependencies  generalize  fd’s.  An  equality  generating  dependency 
(egd)  says  that  if  some  tuples,  fulfilling  certain  equalities,  exist  in  the  database,  then  some 
values  in  these  tuples  must  be  equal.  Formally,  an  egd  is  a  pair  <(aha^J>,  where  /  is  a  rela¬ 
tion  and  a\  and  a2  occur  in  /.  A  relation  J  satisfies  <(a\,a]),l>  if  for  any  valuation  A  such 
that  A(/)€7  we  have  h{a{)~h{a-^.  Note  that  if  ai  =  a2  then  <(a1,a2),/>  is  trivially  satisfied  by 
every  relation. 

Lemma  1.  [BV3]  Let  A  be  a  valuation  and  let  <(ai,a^),I>  be  an  egd.  Then 

<(a1,a2),/>  h <(A (ai),A (aj)),h (/ )>.  ■ 

Total  tuple  generating  dependencies  generalize  tjd’s.  A  total  tuple  generating  dependency 
(ttgd)  says  that  if  some  tuples,  fulfilling  certain  equalities,  exist  in  the  database,  then  another 
tuple,  whose  values  are  taken  from  these  tuples,  must  also  exist  in  the  database.  Formally,  a 
ttgd  is  a  pair  <w,/>,  where  /  is  a  relation  and  tv  is  a  tuple  whose  entries  occur  in  /.  A  rela¬ 
tion  J  satisfies  <w,I>  if  for  any  valuation  A  such  that  h(I)QJ,  we  have  that  h(w)€J.  Note 
that  if  w€7,  then  <w,I>  is  trivially  satisfied  by  every  relation. 

Fd‘s  and  gd’s  can  be  viewed  as  a  special  case  of  egd’s  and  ttgd's,  respectively.  Consider 
the  fd  X-*A,  where  AtX.  We  define  an  egd  tx-a  =<(aO,al)J>  as  follows:  J  consists  of 
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two  tuples  u  and  v  such  that  u[X)=v[X],  u{5]^vl£]  for  all  BIX,  u[/<]=aO,*and  v[^]=al. 
We  leave  it  to  the  reader  to  verify  that  a  relation  K  satisfies  X-*A  if  and  only  if  it  satisfies 
tx—a-  Consider  the  qd  *[R],  where  R  =  {/?b  ....  /?*}.  We  define  a  ttgd  rR=<w,/>,  where 

w  is  an  arbitrary  tuple,  7  =  {w1( - w*},  */[/?;]=*[/?;],  and  if  /f€R,  then  w,[/4]  has  a 

unique  occurrence  in  7.  It  is  shown  in  [ASU]  that  a  relation  K  satisfies  *[R]  if  and  only  if  it 
satisfies  rR. 

Example  1.  Let  U  -  ABCD .  Let  7  and  J  be  the  relations: 

A  B  C  D  ABCD 

I:  aO  bO  cl  dO  J:  aO  bO  cO  dO 
aO  bl  cO  dl  al  bO  cO  dl 

Let  u  be  the  tuple: 

ABCD 
aO  bO  cO  dO 

Let  rx  be  the  ttgd  <«,/>.  Then  rx  is  equivalent  to  the  tjd  *[ABD,AC).  Let  r2  be  the  egd 
<(aO,al)y>.  Then  r2  is  equivalent  to  the  fd  BC-+A .  ■ 

An  algorithm  for  testing  implication  of  egd’s  and  ttgd’s,  called  the  chase,  was  presented  in 
[BV2],  following  earlier  algorithms  for  testing  implications  of  fd’s  and  tjd’s  [ABU.MMS].  We 
present  here  the  special  case  where  the  implying  dependencies  are  functional.  Without  loss  of 
generality  we  assume  that  the  given  fd’s  have  a  single  attribute  on  their  right-hand  side.  A 
chase  of  a  relation  I  by  a  set  I  of  fd's  is  a  maximal  sequence  of  distinct  relations  /q ,I\,  •  *  - 
such  that  7  =  /0  and  7„+1  is  obtained  from  7„  by  an  application  of  a  chase  rule.  To  each  fd  in 
X  there  corresponds  a  chase  rule; 

FD-rule  (for  an  fd  X-*Y  in  2):  7„+1  is  obtained  from  7„  by  identifying  all  occurrences  of 
u[A)  with  all  occurrences  of  v[A ],  for  some  tuples  u  and  v  in  7„  such  that  u[X]= vpf]. 

To  make  the  FD-rule  unambiguous,  we  assume  that  the  domain  of  values  for  each  attri¬ 
bute  is  totally  ordered  and  whenever  two  values  are  identified,  the  greater  is  identified  with  the 
smaller.  Given  <w,I>,  we  take  w(A)  as  the  smallest  value  in  the  domain  for  A.  (We  can 
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always  rename  the  values  in  w  and  7  so  that  this  is  true).  Similarly,  given  we  take 

a\  as  die  smallest  values  in  the  domain  for  A  and  a2  as  the  next  smallest  Thus,  the  values  in 
w  or  flj  do  not  change  in  the  chase  and  a 2  can  be  identified  only  with  ai.  It  is  shown  in  [BV2] 
that  all  chases  of  /  by  2  are  finite  and  they  have  the  same  final  relation,  denoted  chase^I). 

To  trace  the  tuples  of  /  in  the  chase  we  adjoin  to  each  tuple  an  ordinal  number,  that  is 
7  =  {w1, ....  wk}.  The  ordinal  numbers  do  not  change  during  the  computation  of  the  chase, 
though  the  values  in  the  tuples  may  change  by  the  FD-rules.  Thus,  /„  consists  of  the  tuples 
wf, . . . .  w£  (not  necessarily  distinct),  which  correspond  to  the  tuples  wh  ...  ,wk  of  /.  We 
denote  by  w',  the  tuple  in  chase  j(7)  that  correspond  to  wt. 

Lemma  2.  [BV2]  Let  2  be  a  set  of  fd’s  and  let  7  ={w1, ....  wk}  be  a  relation. 

(1)  2  t=<w,/>  if  and  only  if  w€cAasej(7). 

(2)  2  ],wj[A  D,7>  if  and  only  if  w'i[A]= w 'j[A\.  ■ 

3.2.  The  Proof 

Suppose  that  R ={/?!, . . . ,  Rk}  has  a  lossless  join  with  respect  to  a  set  2  of  fd’s.  That 
is,  2j=*[R],  If  l/€R,  then  the  claim  of  the  theorem  is  trivially  satisfied.  So  assume  that 
UWL  Let  tr=<w,7>.  That  is,  7  =  {wj, . . . .  w*},  w, [£,]=*[/{,],  and  if  A€R/  then  w/M 
has  a  unique  occurrence  in  7.  Since  2  ^  *[R],  also  2  |=rR.  By  Lemma  2,  there  is  some  i  such 
that  w'i  =  w.  That  is,  w't[A]ssw[A]  for  all  AiRi.  By  Lemma  2  we  have  that 
2}=<(w,M],wM]),7>,  for  all  AIR,.  Let  rRl->A  =<(aO,al)J>.  That  is,  J  consists  of  two 
tuples  u  and  v  such  that  */[/?/]=  v[^/],  «[£]^v[£]  for  all  BtRt,  u[/f]  =  aO,  and  v[£J=al. 
We  define  a  valuation  h  such  that  A(w/)=u  and  h(wj)=  v  for  jzdi.  It  is  easy  to  see  that  h  is 
well  defined,  A(7)=7,  Mw<M])=aO  and  h(w{^])=al.  By  Lemma  1, 
<(w4[/1],w(/<P,7>)stj,(--<.  Il  follows  that  2 for  all  A€R,,  and,  consequently, 

2M,-*C7. 


4.  The  Sufficient  Condition 


4.1.  Preliminary  Notions 

We  prove  the  sufficient  condition  via  an  excursion  through  the  theory  of  join  dependen¬ 
cies.  We  need  first  to  review  some  definitions. 

* 

For  an  asc  R the  attribute  set  of  R,  dennoted  a//r(R),  is  (J  R,.  A  join 

1=1 

dependency  (jd)  is  a  statement  *[R],  where  R  is  an  asc  {Rj. . . . .  Rk}.  (We  also  use  the  nota- 

k 

tion  ....  R*].)  It  is  satisfied  by  a  relation  I  if  7[o//r(R)]=  *  /[/?,].  Note  that  a  tjd  is  a 

/=i 

sepcial  case  of  a  jd.  Jd’s  are  called  in  [MMS]  embedded  jd’s.  Let  R  and  S  be  asc’s  such  that 
aur(R)=attr(S).  We  say  that  S  covers  R,  denoted  R<S,  if  for  all  R  ER  there  is  some  S€S 
such  that  R  CS. 

Lemma  3.  [B  VI]  Let  X,  Y,  and  Z  be  attribute  sets,  and  let  R  and  S  be  asc’s.  Then 

(1)  0h'in 

(2)  If  R<S,  then  *[R]h*[S]. 

(3)  If  a«r(S)€R,  then  {*[RJ.  *(S]}  N  *[R -  (a«r(S)}USJ. 

(4)  X~>Yf[XYJCZ\.n 

4.2.  The  Proof 

It  is  shown  in  [BB]  that  if  1)=R-*U,  then  there  there  is  a  sequence 

Xi~+Y], . . .  ,X„-*Y„  of  fd’s  from  2  with  the  following  property:  Let  Rq=R  and 
R(  =  Rj_]T,,  1^/ </i.  Then  X/QR/.i,  for  and  R„  =  U.  We  show  by  induction  on 

/  that  2  Mi- 

Basts  0  =0).  By  (1)  of  Lemma  3,  we  have  2  ]=  *]/?]. 

Induction.  Suppose  that  2)=*tR,.Yiyi, . . .  Since  XtQRt-\,  we  have  that 

by  (4)  of  Lemma  3.  Since  RU(U  T;T/)=R/_1. 
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2 1=  *[R  ,*1  Yx. . . . .  X,  Yt]  by  (3)  of  Lemma  4. 

In  particular  we  have  2  *[R,X\Y\f . . .  ,X„  Y„ ].  But  by  assumption  R €R  and  2  is 

embeddable  in  R,  so  {R  ,X\  Yj, . . . ,  X„  y„}<R.  It  follows  that  2  |=  *[R]  by  (2)  of  Lemma  4. 
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